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Amendments to the Claims: 

This listing of claims will replace all prior versions and listings of claims in the application: 
Listing of Claims: 

Claim 1 (currently amended): A method for searching a collection of items, wherein each item 
in the collection has a set of properties, comprising the steps of: 

obtaining a query composed of a first set of one or more properties; and 

obtaining a result based on applying a distance function to one or more of the items the 
query and an item in the collection having a second set of one or more properties , wherein 

obtaining a result includes determining a third set of properties common to the 
first set of one or more properties and the second set of one or more properties, and 

the distance function determines a distance between the query and an item in the 
collection based on the number of items in the collection that are associated with all of the 
properties in the intersection of th e first set of properti e s and the s e t of properti e s for the it e m 
third set of properties . 

Claim 2 (original): The method of claim 1, further including the step of associating each item in 
the collection with a set of properties. 

Claim 3 (currently amended): The method of claim 1, wherein the step of obtaining a result 
includes identifying one or more result items whose distance from the query is within a first 
threshold. 

Claim 4 (currently amended): The method of claim 3, wherein the step of obtaining a result 
includes ranking the one or more result items according to their distance from the query. 

Claim 5 (original): The method of claim 3, wherein the threshold is defined as a number of 
result items. 

Claim 6 (original): The method of claim 3, wherein the threshold is defined as a distance. 

Claim 7 (original): The method of claim 1, further including the step of returning the result. 
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Claim 8 (original): The method of claim 1, wherein the step of obtaining a query includes the 
step of mapping a received query to a set of one or more properties. 

Claim 9 (original): The method of claim 1, wherein one or more of the properties are binary. 

Claim 10 (original): The method of claim 1, wherein one or more of the properties are related by 
a partial order, and wherein, if an item is associated with a property, then the item is also 
associated with all ancestors of that property in the partial order. 

Claim 1 1 (currently amended): The method of claim [[6]] K), wherein one or more of the 
properties represent numerical values or ranges, and wherein the partial order reflects a set of 
containment relationships among the numerical values or ranges. 

Claim 12 (original): The method of claim 1, wherein the properties are grouped into equivalence 
classes. 

Claim 13 (original): The method of claim 12, further including the step of grouping the 
properties into equivalence classes using clustering. 

Claim 14 (original): The method of claim 13, wherein each property has a set of subproperties, 
wherein the clustering is performed such that the distance between two properties in the 
collection is correlated to the number of properties in the collection that are associated with all of 
the subproperties common to both properties. 

Claim 15 (original): The method of claim 1, wherein the query corresponds to a single item in 
the collection. 

Claim 16 (original): The method of claim 1, wherein the query corresponds to a plurality of 
items in the collection. 

Claim 17 (original): The method of claim 1 7 wherein the query is independent of the items in the 
collection. 

Claim 18 (original): The method of claim 1, wherein the step of obtaining a result is constrained 
to a subcollection of the items in the collection. 

Claim 19 (original): The method of claim 18, wherein the subcollection is specified as an 
expression of properties. 
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Claim 20 (original): The method of claim 19, wherein the expression includes a subset of the set 
of properties that compose the query. 

Claim 21 (original): The method of claim 1, wherein the step of obtaining a query includes 
identifying certain properties to be ignored in the step of obtaining a result. 

Claim 22 (original): The method of claim 1, wherein the distance function is applied explicitly. 

Claim 23 (original): The method of claim 1, wherein the distance function is applied implicitly. 

Claim 24 (original): The method of claim 23, wherein the step of obtaining a result includes the 
step of iterating a random walk process to select potential result items. 

Claim 25 (original): The method of claim 24, wherein the step of obtaining a result includes 
ranking the potential result items by frequency and selecting the potential result items having 
higher frequencies. 

Claim 26 (original): The method of claim 23, wherein the step of obtaining a result includes 
iterating through one or more subsets of the query and identifying items associated with the one 
or more subsets. 

Claim 27 (original): The method of claim 26, wherein the one or more subsets are prioritized 
according to the number of items in the collection that have all of the properties in each subset 
and wherein iterating through one or more subsets of the query is continued until a first threshold 
is reached. 

Claim 28 (original): The method of claim 1, wherein the step of obtaining a result includes 
applying a Euclidean distance function. 

Claim 29 (original): The method of claim 28, wherein the step of obtaining a result includes 
merging a first result determined by applying the distance function and a second result 
determined by applying the Euclidean distance function. 

Claim 30 (original): The method of claim 28, wherein the step of obtaining a result includes 
determining a first result by applying either the distance function or the Euclidean distance 
function and applying the other distance function to the first result. 
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Claim 31 (currently amended): A method for analyzing two sets of properties from a plurality of 
sets of properties, comprising the steps of: 

determining a set of common properties in the intersection of common to the two sets of 
properties; 

determining the number of sets of properties from the plurality of sets of properties that 
include the set of common properties; and 

assessing the distance between the two sets of properties as a function of the number of 
sets of properties that include the set of common properties. 

Claim 32 (original): A method for analyzing the relationship between two items in a collection 
of items, wherein each item in the collection is associated with a set of properties, comprising the 
steps of: 

obtaining a set of properties with which the two items are commonly associated; and 

determining the degree of commonality between the two items as a function of the 
number of items in the collection that are associated with all of the properties with which the two 
items are commonly associated. 

Claim 33 (original): A computer program product, residing on a computer readable medium, for 
use in searching a collection of items, the computer program product comprising instructions for 
causing a computer to: 

receive a query composed of one or more properties; and 

obtain a result based on applying a distance function to one or more items the query and 
an item in the collection having a second set of one or more properties , 

wherein the distance function determines a third set of properties common to the first set 
of one or more properties and the second set of one or more properties, and determines a distance 
between the query and an item in the collection based on the number of items in the collection 
that are associated with all of the properties in the intersection of th e fir s t s e t of properties and 
the set of properties for the item third set of properties . 
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Claim 34 (original): The computer program product of claim 33, wherein the instructions cause 
the computer to obtain a result by identifying exactly the items whose distance from the query is 
within a threshold. 

Claim 35 (original): The computer program product of claim 33, wherein the instructions cause 
the computer to obtain a result by identifying approximately the items whose distance from the 
query is within a threshold according to a heuristic. 

Claim 36 (original): The computer program product of claim 35, wherein the heuristic permits a 
trade-off between the accuracy and the performance of a search. 

Claim 37 (original): The computer program product of claim 35, wherein the heuristic includes 
the use of a random walk process. 

Claim 38 (currently amended): A computer system for managing data records comprising: 

an information retrieval subsystem that stores and retrieves data records, each data record 
being associated with a set of properties; and 

a similarity search subsystem that receives similarity search queries and processes 
similarity search queries based on a distance function, a similarity search query being associated 
with a first set of properties, 

wherein the distance function determines a distance between the query and a data record 
in the collection having a second set of properties based on determining a third set of properties 
common to the first set of properties and the second set of properties, and determining the 
number of data records in the collection that are associated with all of the properties in the 
int e rsection of th e first set of properties and the set of properties for the data r e cord third set of 
properties . 

Claim 39 (original): The computer system of claim 38, further including a clustering-subsystem 
that employs the distance function of the similarity search subsystem to construct a graph. 

Claim 40 (withdrawn): A method for applying a matching algorithm to a collection of items, 
each item being associated with a set of properties, comprising the steps of: 



BOSTON 1 87423 lvl 



8 Of 11 



Appl. No.: 
Reply Dated: 
Office Action of: 



10/027,195 
March 22, 2004 
September 25, 2003 



Atty. Docket No. 109878.125 



constructing a graph having nodes that correspond to items, and having edges that 
correspond to pairs of items, wherein each edge has a cost correlated to the number of items in 
the collection that are associated with all of the properties in the intersection of the sets of 
properties for the two items that the edge links; and 

identifying a subset of the edges that constitutes a minimum-cost matching with respect 
to the graph. 

Claim 41 (withdrawn): A method for applying a clustering algorithm to a collection of items, 
each item being associated with a set of properties, comprising the steps of: 

constructing a graph having nodes that correspond to items, and having edges that 
correspond to pairs of items, wherein each edge has a cost correlated to the number of items in 
the collection that are associated with all of the properties in the intersection of the sets of 
properties for the two items that the edge links; and 

identifying a collection of subsets of the edges that constitutes a minimum-cost clustering 
with respect to the graph. 
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